
ors and Matrices 



You've been using operators for years even if you've never heard the term. Differentiation falls into this 
category; so does rotation; so does wheel-alignment. In the subject of quantum mechanics, familiar 
ideas such as energy and momentum will be represented by operators. You probably think that pressure 
is simply a scalar, but no. It's an operator. 

7.1 The Idea of an Operator 

You can understand the subject of matrices as a set of rules that govern certain square or rectangular 
arrays of numbers — how to add them, how to multiply them. Approached this way the subject is 
remarkably opaque. Who made up these rules and why? What's the point? If you look at it as simply 
a way to write simultaneous linear equations in a compact way, it's perhaps convenient but certainly 
not the big deal that people make of it. It is a big deal. 

There's a better way to understand the subject, one that relates the matrices to more fundamental 
ideas and that even provides some geometric insight into the subject. The technique of similarity 
transformations may even make a little sense. This approach is precisely parallel to one of the basic 
ideas in the use of vectors. You can draw pictures of vectors and manipulate the pictures of vectors and 
that's the right way to look at certain problems. You quickly find however that this can be cumbersome. 
A general method that you use to make computations tractable is to write vectors in terms of their 
components, then the methods for manipulating the components follow a few straight-forward rules, 
adding the components, multiplying them by scalars, even doing dot and cross products. 

Just as you have components of vectors, which are a set of numbers that depend on your choice 
of basis, matrices are a set of numbers that are components of — not vectors, but functions (also 
called operators or transformations or tensors). I'll start with a couple of examples before going into 
the precise definitions. 

The first example of the type of function that I'll be interested in will be a function defined on 
the two-dimensional vector space, arrows drawn in the plane with their starting points at the origin. 
The function that I'll use will rotate each vector by an angle a counterclockwise. This /'s a function, 
where the input is a vector and the output is a vector. 



What happens if you change the argument of this function, multiplying it by a scalar? You know 
/(■u), what is f{cv)7 Just from the picture, this is c times the vector that you got by rotating v. 
What happens when you add two vectors and then rotate the result? The whole parallelogram defining 
the addition will rotate through the same angle a, so whether you apply the function before or after 
adding the vectors you get the same result. 

This leads to the definition of the word linearity: 



f{vi + V2) 




f{cv) = cf{v), 



and 



f{Vl+V2)=f{Vl)+f{V2) 



(7.1) 



Keep your eye on ttiis pair of equations! They're central to the whole subject. 
James Nearing, University of Miami 1 
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Another example of the type of function that I'll examine is from physics instead of mathematics. 
A rotating rigid body has some angular momentum. The greater the rotation rate, the greater the 
angular momentum will be. Now how do I compute the angular momentum assuming that I know the 
shape and the distribution of masses in the body and that I know the body's angular velocity? The 
body is made of a lot of point masses (atoms), but you don't need to go down to that level to make 
sense of the subject. As with any other integral, you start by dividing the object in to a lot of small 
pieces. 

What is the angular momentum of a single point mass? It starts from basic Newtonian mechanics, 

— * 

and the equation F = dp/dt. (It's better in this context to work with this form than with the more 
common expressions F = ma.) Take the cross product with r, the displacement vector from the origin. 

fx F = fx dp/ dt 

Add and subtract the same thing on the right side of the equation (add zero) to get 

^ ^ dp df ^ df ^ 

r X r = rx-r- + -r-xp tt x P 

dt dt dt 

d df 
rxp)-^^xp 



dt 



Now recall that p \s mv, and v = df/dt, so the last term in the preceding equation is zero because 
you are taking the cross product of a vector with itself. This means that when adding and subtracting 
a term from the right side above, I was really adding and subtracting zero. 

r X is the torque applied to the point mass m and r'x p is the mass's angular momentum 
about the origin. Now if there are many masses and many forces, simply put an index on this torque 
equation and add the resulting equations over all the masses in the rigid body. The sums on the left 
and the right provide the definitions of torque and of angular momentum. 



d 



-Ttotal = J2^k X Fk = i^k X Pk) 



dL 
~dt 



For a specific example, attach two masses to the ends of a light rod and attach that rod to 
a second, vertical one as sketched — at an angle. Now spin the vertical rod and figure out what 
the angular velocity and angular momentum vectors are. Since the spin is along the vertical rod, that 
specifies the direction of the angular velocity vector uj to be upwards in the picture. (Viewed from above 
everything is rotating counter-clockwise.) The angular momentum of one point mass is fxp = fxmv. 
The mass on the right has a velocity pointing into the page and the mass on the left has it pointing 
out. Take the origin to be where the supporting rod is attached to the axis, then fx p for the mass 
on the right is pointing up and to the left. For the other mass both r and p are reversed, so the cross 
product is in exactly the same direction as for the first mass. The total angular momentum the sum of 

these two parallel vectors, and it is not in the direction ^f the angular velocity. 

ri X miVi 




V2{out) 
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Now make this quantitative and apply it to a general rigid body. There are two basic pieces to 
the problem: the angular momentum of a point mass and the velocity of a point mass in terms of its 
angular velocity. The position of one point mass is described by its displacement vector from the origin, 
f. Its angular momentum is then fxp, where p = mv. If the rigid body has an angular velocity vector 
cJ, the linear velocity of a mass at coordinate r is x r . 




The total angular momentum of a rotating set of masses at respective coordinates fj^ is the 
sum of all the individual pieces of angular momentum 

L = X rrij^vi^, and since = cJ x r^, 

- (7.2) 
L = 2_]nc X mk{uj X fk) 

k 

If you have a continuous distribution of mass then using an integral makes more sense. For a given 
distribution of mass, this integral (or sum) depends on the vector uj. It defines a function having a 
vector as input and a vector L as output. Denote the function by I, so L = 

L = y dmrx (co x r) = I{u) (7.3) 

This function satisfies the same linearity equations as Eq. (7.1). When you multiply cJ by a 
constant, the output, L is multiplied by the same constant. When you add two cu's together as the 
argument, the properties of the cross product and of the integral guarantee that the corresponding L's 
are added. 

I{cuj) = cl{u), and /(wi + ^2) = /(wi) + /(a;2) 

This function / is called the "inertia operator" or more commonly the "inertia tensor." It's not simply 
multiplication by a scalar, so the rule that appears in an introductory course in mechanics {L = Icu) is 
valid only in special cases, for example those with enough symmetry. 

Note: / is not a vector and L is not a function. L is the output of the function / when you 
feed it the argument u. This is the same sort of observation appearing in section 6.3 under "Function 
Spaces." 

If an electromagnetic wave passes through a crystal, the electric field will push the electrons 
around, and the bigger the electric field, the greater the distance that the electrons will be pushed. 
They may not be pushed in the same direction as the electric field however, as the nature of the crystal 
can make it easier to push the electrons in one direction than in another. The relation between the 
applied field and the average electron displacement is a function that (for moderate size fields) obeys 
the same linearity relation that the two previous functions do. 



P = a{E) 
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P is the electric dipole moment density and E is the applied electric field. The function a is called the 
polarizability. 

If you have a mass attached to six springs that are in turn attached to six walls, 
the mass will come to equilibrium somewhere. Now push on this mass with another 
(not too large) force. The mass will move, but will it move in the direction that 
you push it? If the six springs are all the same it will, but if they're not then the 
displacement will be more in the direction of the weaker springs. The displacement, 

— * — * 

d, will still however depend linearly on the applied force, F. 

7.2 Definition of an Operator 

An operator, also called a linear transformation, is a particular type of function. It is first of all, a vector 
valued function of a vector variable. Second, it is linear; that is, if A is such a function then A{v) is a 
vector, and 

A{avi + l3v2) = aA{vi) + l3A{v2). (7.4) 

The domain is the set of variables on which the operator is defined. The range is the set of all values 
put out by the function. Are there nonlinear operators? Yes, but not here. 

7.3 Examples of Operators 

The four cases that I started with, rotation in the plane, angular momentum of a rotating rigid body, 
polarization of a crystal by an electric field, and the mass attached to some springs all fit this definition. 
Other examples: 

5. The simplest example of all is just multiplication by a scalar: A{v) = cv for all v. This applies to 
any vector space and its domain is the entire space. 

6. On the vector space of all real valued functions on a given interval, multiply any function / by 
1 + x^: {Af){x) = (1 + x'^)f{x). The domain of A is the entire space of functions of x. This 
is an infinite dimensional vector space, but no matter. There's nothing special about 1 +x'^, and 
any other function will do to define an operator. 

7. On the vector space of square integrable functions [/ dx\f{x)\^ < oo] on a < x < 6, define 
the operator as multiplication by x. The only distinction to make here is that if the interval is 
infinite, then xf{x) may not itself be square integrable. The domain of this operator in this case is 
therefore not the entire space, but just those functions such that xf{x) is also square-integrable. 
On the same vector space, differentiation is a linear operator: {Af){x) = f'{x). This too has a 
restriction on the domain: It is necessary that /' also exist and be square integrable. 

8. On the vector space of infinitely differentiable functions, the operation of differentiation, d/dx, is 
itself a linear operator. It's certainly linear, and it takes a differentiable function into a differentiable 
function. 

So where are the matrices? This chapter started by saying that I'm going to show you the inside scoop 
on matrices and so far I've failed to produce even one. 

When you describe vectors you can use a basis as a computational tool and manipulate the 
vectors using their components. In the common case of three-dimensional vectors we usually denote 
the basis in one of several ways 

h J, k, or X, y, z, or ei, 62, 63 

and they all mean the same thing. The first form is what you see in the introductory physics texts. The 
second form is one that you encounter in more advanced books, and the third one is more suitable when 
you want to have a compact index notation. It's that third one that I'll use here; it has the advantage 
that it doesn't bias you to believe that you must be working in three spatial dimensions. The index 
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could go beyond 3, and the vectors that you're dealing with may not be the usual geometric arrows. 
(And why does it have to start with one? Maybe I want the indices 0, 1, 2 instead.) These need not 
be perpendicular to each other or even to be unit vectors. 
The way to write a vector v in components is 

V = VxX + Vyy + VzZ, or fiCi + t;2e2 + ^'3e3 = ^ (7-5) 

k 

Once you've chosen a basis, you can find the three numbers that form the components of that 
vector. In a similar way, define the components of an operator, only that will take nine numbers to do 
it (in three dimensions). If you evaluate the effect of an operator on any one of the basis vectors, the 
output is a vector. That's part of the definition of the word operator. This output vector can itself be 
written in terms of this same basis. The defining equation for the components of an operator / is 



3 

/(ei) = ^fkiek 

k=l 



(7.6) 



For each input vector you have the three components of the output vector. Pay careful attention 
to this equation! It is the defining equation for the entire subject of matrix theory, and everything in 
that subject comes from this one innocuous looking equation. (And yes if you're wondering, I wrote 
the indices in the correct order.) 

Why? 

Take an arbitrary input vector for f:u = f{v). Both u and v are vectors, so write them in 
terms of the basis chosen. 

u = ^Ukek = f{v) = /(Xl^i^i) = (7.7) 

k i i 

The last equation is the result of the linearity property, Eq. (7.1), already assumed for /. Now pull the 
sum and the numerical factors Vi out in front of the function, and write it out. It is then clear: 

fiviei +^262) = fiviei) + f{v2e2) = vif{ei) + V2f{e2) 

Now you see where the defining equation for operator components comes in. Eq. (7.7) is 

k i k 

For two vectors to be equal, the corresponding coefficients of ei, 62, etc. must match; their respective 
components must be equal, and this is 

Uk = Yl '"ifkh usually written «fc = XI fki^i (7.8) 

i i 

SO that in the latter form it starts to resemble what you may think of as matrix manipulation. /row,coiumn 
is the conventional way to write the indices, and multiplication is defined so that the following product 
means Eq. (7.8). 

fll fl2 fl3 

/2I /22 /23 I I ^^2 I (7.9) 

/3I /32 /33 







U2 















V2 




{-J 
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fn fn" 

/2I /22 /23 I I ^2 I is Ui= fnVi + fi2V2 + fl^Vs etC. 

/31 /32 /as , 

And this is the reason behind the definition of how to multiply a matrix and a column matrix. The 
order in which the indices appear is the conventional one, and the indices appear in the matrix as they 
do because I chose the order of the indices in a (seemingly) backwards way in Eq. (7.6). 

Components of Rotations 

Apply this to the first example, rotate all vectors in the plane through the angle a. I don't want to 
keep using the same symbol / for every function, so I'll call this function R instead, or better yet Ra- 
Ra{v) is the rotated vector. Pick two perpendicular unit vectors for a basis. You may call them x and 
y, but again I'll call them Ci and 62- Use the definition of components to get 



Ra{e2 




Ra{ei) 



k 

Ra{e2) = -Rfc2efc 

k 



(7.10) 



cos a d 
The rotated ei has two components, so 



Ra{(ii) = Ci COS a + 6*2 sin a = -RuCi + R2ie2 
This determines the first column of the matrix of components, 

= cos a, and R21 = sin a 
Similarly the effect on the other basis vector determines the second column: 

Ra{e2) = 6*2 COS a — 6*1 sin a = R12 e.\ + R22 6*2 
Check: -Ra(ei) ■ Ra{fi2) = 0. 

R12 = — sin a, and R22 = cos a 

The component matrix is then 

{Ra) -- 



(7.11) 



(7.12) 



cos a 
sin a 



— sma 
cos a 



(7.13) 



Components of Inertia 

The definition, Eq. (7.3), and the figure preceding it specify the inertia tensor as the function that 
relates the angular momentum of a rigid body to its angular velocity. 



Use the vector identity, 
then the integral is 



dm r X (a; X r ) = /(cD) 

Ax {B xC) = B{A-C) -C{A-B) 
dm [uj{f- f) — f{uj ■ f)] = I{uj) 



(7.14) 

(7.15) 
(7.16) 



Pick the common rectangular, orthogonal basis and evaluate the components of this function. Equa- 
tion (7.6) says r = xei+ ye2 + -263 so 
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k 

/(ei) = J dm [ei(x^ + + z^) - {xei + ye2 + 2;e3)(x)] 

= hiCi + 12162 + JsiCs 

111 = y dm{y^ + z^), hi = — J dmyx, I^i = — J dmzx 



from which 



This provides the first column of the components, and you get the rest of the components the same 
way. The whole matrix is 

2 _|_ ^2 \ 

dm I -xy x^ + z"^ -yz (7-17) 

—xz —yz 



+ y'^ 



These are the components of the tensor of inertia. The diagonal elements of the matrix may be 
familiar; they are the moments of inertia, + y"^ is the perpendicular distance-squared to the ^-axis, 
so the element /as (= Izz) is the moment of inertia about that axis, / dmr'^. The other components 
are less familiar and are called the products of inertia. This particular matrix is symmetric: Jjj 
That's a special property of the inertia tensor. 

Components of Dumbbell 

Look again at the specific case of two masses rotating about an axis. Do it quantitatively. 



ri X miVi 




The integrals in Eq. (7.17) are simply sums this time, and the sums have just two terms. I'm 
making the approximation that these are point masses. Make the coordinate system match the indicated 
basis, with x right and y up, then z is zero for all terms in the sum, and the rest are 

J dm [y^ + z^) = mir\ cos^ a + m2r| cos^ a 

— j dm xy = —mir\ cos a sin a — m2r\ cos a sin a 
j dm {x^ + z"^) = mir\ svc? a + m2r| sin^ a 



dm (x^ + y"^) = mirf + m2r2 



The matrix is then 



(mirl + m2r|) 



cos^ a 
cos a sin a 




cos a sm a 
sin^ a 



(7.18) 
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Don't count on all such results factoring so nicely. 

In this basis, the angular velocity (2 has just one component, so what is LI 



cos^ a 


— cos a. sin a. 


0^ 






cos a. sin a 


sin^ a 





II 

















-UJ COS a sm a 



{rn\r\ + m2r2) cusi 



sm a 




Translate this into vector form: 



m\r\ + m2r|)a; sinQ;( 



Ci cos a + 6*2 sin a) 



(7.19) 



When a = 90°, then cos a = and the angular momentum points along the y-axis. This is the 
symmetric special case where everything lines up along one axis. Notice that if a = then everything 
vanishes, but then the masses are both on the axis, and they have no angular momentum. In the general 
case as drawn, the vector L points to the upper left, perpendicular to the line between the masses. 

Parallel Axis Theorem 

When you know the tensor of inertia about one origin, you can relate the result to the tensor about a 
different origin. 

The center of mass of an object is 



1 

M 



fdm 



(7.20) 



where M is the total mass. Compare the operator / using an origin at the center of mass to / about 
another origin. 




r — Tr 



I{u ) = dmfx{uxf)= dm [f- f^^ + fcm] x [Q x[f- f^^ + fcm] ) 

= / dm [f — fern] X {Q X [f — rem]) + / dm fern X (cu X -Tcm) + two cross terms 
The two cross terms vanish, problem 7.17. What's left is 

/(cJ) = dm [f- fcm] X (a; X [f- fcm]) + Mrcm x (a; x fcm) 

= /cm(w ) + M fen, X (w X fcm) 
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Put this in words and it says that the tensor of inertia about any point is equal to the tensor of inertia 
about the center of mass plus the tensor of inertia of a point mass M placed at the center of mass. 

As an example, place a disk of mass M and radius R and uniform mass density so that its center 
is at {x,y,z) = (i?, 0, 0) and it is lying in the x-y plane. Compute the components of the inertia 
tensor. First get the components about the center of mass, using Eq. (7.17). 




X 



The integrals such as 



dmxy, 



dmyz 



are zero. For fixed y each positive value of x has a corresponding negative value to make the integral 
add to zero. It is odd in x (or y)\ remember that this is about the center of the disk. Next do the 
integral. 

j dm {x'^ + y'^) = J dmr'^ = j ^^^dAr"^ 
For the element of area, use dA = 2TTr dr and you have 

M I 



M , ^ 3 
/gg = / dr 



TiR'jo 

For the next two diagonal elements, 



ttR^'^'^ 4 2 



MR-" 



In = J dm{y'^ + z'^) = j dmy^ and /22 = j dm{x'^ + z'^) = j dmx"^ 
Because of the symmetry of the disk, these two are equal, also you see that the sum is 



hi +h2 = j dmy"^ + J dmx"^ = Jgg = 1 



MR^ 



(7.23) 



This saves integration, hi = -^22 = MR^/i. 

For the other term in the sum (7.22), you have a point mass at the distance R along the x-axis, 
{x,y,z) = (i?, 0,0). Substitute this point mass into Eq. (7.17) and you have 

'00 

M \ R^ 

R\ 

The total about the origin is the sum of these two calculations. 

1/4 
MR^ (0 5/4 
3/2 
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Why is this called the parallel axis theorem when you're translating a point (the origin) and not an 
axis? Probably because this was originally stated for the moment of inertia alone and not for the whole 
tensor. In that case you have only an axis to deal with. 

Components of the Derivative 

The set of all polynomials in x having degree < 2 forms a vector space. There are three independent 
vectors that I can choose to be 1, x, and x"^ . Differentiation is a linear operator on this space because 
the derivative of a sum is the sum of the derivatives and the derivative of a constant times a function 
is the constant times the derivative of the function. With this basis I'll compute the components of 
d/dx. Start the indexing for the basis from zero instead of one because it will cause less confusion 
between powers and subscripts. 

eo = 1, e 1 = X, 62 = x"^ 
By the definition of the components of an operator — I'll call this one D, 

D{eo) = ^1 = 0, D{ei) = -^x = 1 = eo, D{e2) = ^x^ = 2x = 2ei 

These define the three columns of the matrix. 

1 0\ dx^ 
(D) = 2 check: -j- = 2x\s 

\0 0/ 

There's nothing here about the basis being orthonormal. It isn't. 
7.4 IVIatrix IVIultiplication 

How do you multiply two matrices? There's a rule for doing it, but where does it come from? 
The composition of two functions means you first apply one function then the other, so 

h = fog means h{v) = f{g{v)) (7.24) 

I'm assuming that these are vector-valued functions of a vector variable, but this is the general definition 
of composition anyway. If / and g are linear, does it follow the h is? Yes, just check: 

h{cv) = f{g{cv)) = f{cg{v)) = cf{g{v)), and 

^(Vl +V2) = /(^(^/l +^/2)) = /(^(^/l) +^(t?^)) = /(^(^/l)) +/(^(^?2)) 

What are the components of hi Again, use the definition and plug in. 

^(^i) = ^hkiek = f [gieij) =f{J2 9ji ^j) = 9jif i^j) = 9ji Yl fkj 4 (7.25) 

k 3 j j k 

and now all there is to do is to equate the corresponding coefficients of e/j. 

hki = J23jifkj or more conventionally hki = Yfkj9ji (7-26) 

3 j 

This is in the standard form for matrix multiplication, recalling the subscripts are ordered as frc for 
row-column. 

/ii /i2 /i3 \ / gu gi2 gi3 \ 

/2I /32 /23 5-21 5-32 5-23 ( 7.27) 

/3I /32 /33 / V g31 g32 9'33 / 





hi2 


h 




h32 


h 




h32 


h 
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flu 


hn 


his 






h23 












> h 



The computation of hi2 from Eq. (7.26) is 

fn — frz — fi3\ All |i2 gi3 

/2I /22 /23 I I fi'Sl ''22 923 
/3I /32 /ss / \ fi'31 S32 933 , 

12 = /llfi'12 + /l25'22 + /l3fl'32 

Matrix multiplication is just the component representation of the composition of two functions, 
Eq. (7.26), and there's nothing here that restricts this to three dimensions. In Eq. (7.25) I may have 
made it look too easy. If you try to reproduce this without looking, the odds are that you will not 
get the indices to match up as nicely as you see there. Remember: When an index is summed it is a 
dummy, and you are free to relabel it as anything you want. You can use this fact to make the indices 
come out neatly. 

Composition of Rotations 

In the first example, rotating vectors in the plane, the operator that rotates every vector by the angle 
a has components 

(i?«) = f"°^" -^^""'1 (7.28) 
^ ' \sma cos a J ^ ' 

What happens if you do two such transformations, one by a. and one by /3? The result better be a total 
rotation by a + /3. One function, is followed by the second function and the composition is 

Ra+i3 = RaRp 

This is mirrored in the components of these operators, so the matrices must obey the same equation. 

cos{a + f3) — sin(a + /3)\ _ f cos a — sina\ / cos/3 — sin/3 
sin(a + /3) cos(a + /3) ) ysina cos a ) ysin/3 cos/3 

Multiply the matrices on the right to get 



cos a cos /3 — sin a sin /3 — cos a sin /3 — sin a cos /3 
sin a cos /3 + cos ex sin (3 cos a cos /3 — sin a sin /3 



(7.29) 



The respective components must agree, so this gives an immediate derivation of the formulas for the 
sine and cosine of the sum of two angles. Cf. Eq. (3.8) 

7.5 Inverses 

The simplest operator is the one that does nothing. f{v) = v for all values of the vector v. This implies 
that f{ei) = 6*1 and similarly for all the other elements of the basis, so the matrix of its components 
is diagonal. The 2x2 matrix is explicitly the identity matrix 

(I) = ( \ ?| or in index notation Sji = l\ (i^^ , (7.30) 
^ ^ \0 1 J 1 (if 2 7^ j) ^ ' 

and the index notation is completely general, not depending on whether you're dealing with two dimen- 
sions or many more. Unfortunately the words "inertia" and "identity" both start with the letter "I" and 
this symbol is used for both operators. Live with it. The 5 symbol in this equation is the Kronecker 
delta — very handy. 

The inverse of an operator is defined in terms of Eq. (7.24), the composition of functions. If the 
composition of two functions takes you to the identity operator, one function is said to be the inverse of 
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the other. This is no different from the way you look at ordinary real valued functions. The exponential 
and the logarithm are inverse to each other because* 



In(e^) 



X 



for all X. 



For the rotation operator, Eq. (7.10), the inverse is obviously going to be rotation by the same angle 
in the opposite direction. 

RaR—a — I 

Because the matrix components of these operators mirror the original operators, this equation must 
also hold for the corresponding components, as in Eqs. (7.27) and (7.29). Set j3 = —a in (7.29) and 
you get the identity matrix. 

In an equation such as Eq. (7.7), or its component version Eqs. (7.8) or (7.9), if you want to 
solve for the vector u, you are asking for the inverse of the function /. 

u = fiv) implies v = f^^{u) 

The translation of these equations into components is Eq. (7.9) 



Ml 
U2 



which implies 









CO 


/22 


-/l2 


-/21 


/ll 



Ml 
U2 



(7.31) 



The verification that these are the components of the inverse is no more than simply multiplying the 
two matrices and seeing that you get the identity matrix. 

7.6 Rotations, 3-d 

In three dimensions there are of course more basis vectors to rotate. Start by rotating vectors about 
the axes and it is nothing more than the two-dimensional problem of Eq. (7.10) done three times. You 
do have to be careful about signs, but not much more — as long as you draw careful pictures! 

A' 




\ 



xj) 





The basis vectors are drawn in the three pictures: e\ = x, 62 = y, 63 = z. 

In the first sketch, rotate vectors by the angle a about the x-axis. In the second case, rotate by 
the angle /3 about the y-axis, and in the third case, rotate by the angle 7 about the z-axis. In the first 
case, the ei is left alone. The 62 picks up a little positive 63, and the 63 picks up a little negative 62- 



Raei (6*1) = 6*1, Raei (^2) = 6*2 COS + 63 sin a, Rae^ (^s) = 63 COS a — 6*2 sin a 



(7.32) 



* The reverse, e'"^^ works just for positive x, unless you recall that the logarithm of a negative 
number is complex. Then it works there too. This sort of question doesn't occur with finite dimensional 
matrices. 
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Here the notation represents the function prescribing a rotation by 6 about the axis pointing along 

6. These equations are the same as Eqs. (7.11) and (7.12). 

The corresponding equations for the other two rotations are now easy to write down: 

Rpg^ (ei) = ei cos (3-63 sin (3, Rpg^ (62) = 62, Rpg^ (es) = el sin /3 + 63 cos (3 (7.33) 
-^763 (^1) = ^1 C0S7 + 62 sin7, R^g^ (62) = -ei sin7 + 62 C0S7, R^^^ (cs) = 63 (7.34) 

From these vector equations you immediate read the columns of the matrices of the components of the 
operators as in Eq. (7.6). 

{Raei) {R13S2) (^763) 

10 ^ \ / sin /3 \ / cos 7 — sin 7 \ 

cos a — sina , 1 ' sin 7 cos 7 (7.35) 

sin a cos a j y — sin /3 cos (3 j \ 1 / 

As a check on the algebra, did you see if the rotated basis vectors from any of the three sets of equations 
(7.32)-(7.34) are still orthogonal sets? 

Do these rotation operations commute? No. Try the case of two 90° rotations to see. Rotate 
by this angle about the x-axis then by the same angle about the y-axis. 




{Re,. /2) {Re,. 12) = 1 -1 = -1 (7.36) 




In the reverse order, for which the rotation about the y-axis is done first, these are 

/l \ / l\ /O l\ 
{Re,. 12) {Re,. 12) = -1 1 = 1 (7.37) 

\0 1 / \-l 0/ \0 1 0/ 

Translate these operations into the movement of a physical object. Take the same x-y-z coor- 
dinate system as in this section, with x pointing toward you, y to your right and z up. Pick up a book 
with the cover toward you so that you can read it. Now do the operation ^^^^7^/2 s° ^^^^ cover 
still faces you but the top is to your left. Next do Rg^^^2 ^i^d the book is face down with the top still 
to your left. See problem 7.57 for and algebraic version of this. 

Start over with the cover toward you as before and do Rg^-^f2 so that the top is toward you 
and the face is down. Now do the other operation Rg-^T^/2 ^""^ the top is toward you with the cover 
facing right — a different result. Do these physical results agree with the matrix products of the last 
two equations? For example, what happens to the vector sticking out of the cover, initially the column 
matrix (1 0)? This is something that you cannot simply read. You have to do the experiment for 
yourself. 

7.7 Areas, Volumes, Determinants 

In the two-dimensional example of arrows in the plane, look what happens to areas when an operator 
acts. The unit square with corners at the origin and (0,1), (1,1), 1,0) gets distorted into a paral- 
lelogram. The arrows from the origin to every point in the square become arrows that fill out the 
parallelogram. 
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What is the area of this parallelogram? 

ril ask a more general question. (It isn't really, but it looks like it.) Start with any region in the 
plane, and say it has area Ai. The operator takes all the vectors ending in this area into some new area 
of a size A2, probably different from the original. What is the ratio of the new area to the old one? 
A2/A1. How much does this transformation stretch or squeeze the area? What isn't instantly obvious 
is that this ratio of areas depends on the operator alone, and not on how you chose the initial region 
to be transformed. If you accept this for the moment, then you see that the question in the previous 
paragraph, which started with the unit square and asked for the area into which it transformed, is the 
same question as finding the ratio of the two more general areas. (Or the ratio of two volumes in three 
dimensions.) See the end of the next section for a proof. 

This ratio is called the determinant of the operator. 

The first example is the simplest. Rotations in the plane, Ra- Because rotations leave area 
unchanged, this determinant is one. For almost any other example you have to do some work. Use the 
component form to do the computation. The basis vector ei is transformed into the vector /n ei+/2i 62 
with a similar expression for the image of 62- You can use the cross product to compute the area of 
the parallelogram that these define. For another way, see problem 7.3. This is 

(/iici + /2ie2) X (/laei + /22e2) = (/11/22 - /2i/i2)e3 (7.38) 

The product in parentheses is the determinant of the transformation. 

det(/) = /11/22 - /21/12 (7.39) 

What if I had picked a different basis, maybe even one that isn't orthonormal? From the definition of 
the determinant it is a property of the operator and not of the particular basis and components you 
use to describe it, so you must get the same answer. But will the answer be the same simple formula 
(7.39) if I pick a different basis? Now that's a legitimate question. The answer is yes, and that fact 
will come out of the general computation of the determinant in a moment. [What is the determinant 
of Eq. (7.13)?] 

det > 




The determinant can be either positive or negative. That tells you more than simply how the 
transformation alters the area; it tells you whether it changes the orientation of the area. If you place 
a counterclockwise loop in the original area, does it remain counterclockwise in the image or is it 
reversed? In three dimensions, the corresponding plus or minus sign for the determinant says that 
you're changing from a right-handed set of vectors to a left-handed one. What does that mean? Make 
an x-y-z coordinate system out of the thumb, index finger, and middle finger of your right hand. Now 
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do it with your left hand. You cannot move one of these and put it on top of the other (unless you 
have very unusual joints). One is a mirror image of the other. 

The equation (7.39) is a special case of a rule that you've probably encountered elsewhere. You 
compute the determinant of a square array of numbers by some means such as expansion in minors or 
Gauss reduction. Here I've defined the determinant geometrically, and it has no obvious relation the 
traditional numeric definition. They are the same, and the reason for that comes by looking at how the 
area (or volume) of a parallelogram depends on the vectors that make up its sides. The derivation is 
slightly involved, but no one step in it is hard. Along the way you will encounter a new and important 
function: A. 

Start with the basis e\, 6*2 and call the output of the transformation V\ = /(ei) and V2 = /(e2)- 
The final area is a function of these last two vectors, call it A{vi,V2), and this function has two key 
properties: 

A({;,{7)=0, and A{vi,av2 + /Svs) = aA{vi,V2) + l3A{vi,V3) (7.40) 

That the area vanishes if the two sides are the same is obvious. That the area is a linear function of 
the vectors forming the two sides is not so obvious. (It is linear in both arguments.) Part of the proof 
of linearity is easy: 

A{vi,av2) = aA{vi,V2) 

simply says that if one side of the parallelogram remains fixed and the other changes by some factor, 
then the area changes by that same factor. For the other part, A({7i, + t's) , start with a picture and 
see if the area that this function represents is the same as the sum of the two areas made from the 
vectors Vi&iV2 and Vi&iV^. 

Vi&iV2 form the area OCBA. viiiv-s form the area OCED. 



HDF ^ JEG 
HDO ^ JEC 

so 

area HJGF = area DEGF = area OCBA 
area OCJH = area OCED 

add these equations: 
area OCGF = area OCBA + area OCED 

The last line is the statement that sum of the areas of the two parallelograms is the area of the 
parallelogram formed using the sum of the two vectors: 

A(yi,V2 + V3) = A(yi,V2) +A{vi,V3) 

This sort of function A, generalized to three dimensions, is characterized by 

(1) A{avi+ f3vl,V2,V3) = aA(vi,V2,V3) + /3A(vl,V2,V3) 

(2) A(t;i,t;i,t;3) =0 (7.41) 

It is linear in each variable, and it vanishes if any two arguments are equal. I've written it for three 
dimensions, but in N dimensions you have the same equations with arguments, and these properties 
hold for all of them. 
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Theorem: Up to an overall constant factor, this function is unique. 



An important result is that these assumptions imply the function is antisymmetric in any two 
arguments. Proof: 

A{vi + V2, Vl + V2,V3) = = A{vi, Vl,V3) + A(vi, ^2, ^^s) + A({72, ^^1, ^^s) + HV2, V2,V3} 

This is just the linearity property. Now the left side, and the and 4*^ terms on the right, are zero 
because two arguments are equal. The rest is 

A (vi , ^2 , -ua) + A {V2 ,Vi,V3) =0 (7.42) 

and this says that interchanging two arguments of A changes the sign. (The reverse is true also. Assume 
antisymmetry and deduce that it vanishes if two arguments are equal.) 

I said that this function is unique up to a factor. Suppose that there are two of them: A and A'. 
Now show for some constant a, that A — aA' is identically zero. To do this, take three independent 
vectors and evaluate the number A'({7a, ^^b, '^c) There is some set of v's for which this is non-zero, 
otherwise A' is identically zero and that's not much fun. Now consider 

a = — , ^ ^ ^\ and define Aq = A — aA 
A'[v3,Vb,Vc) 

This function Aq is zero for the special argument: (^4, i?b) ^c). and now 1'!! show why it is zero for all 
arguments. That means that it is the zero function, and says that the two functions A and A' are 
proportional. 

The vectors [Vg, Vb, Vq) are independent and there are three of them (in three dimensions). They 
are a basis. You can write any vector as a linear combination of these. E.g. 

Vl = Ava + Bv[j and V2 = Cv^ + Dv^ and 

Put these (and let's say Vc) into Aq. 

Ao(^'l,^2,fc) = ACAo{Va,Va,Vc) + ADAo{Va,Vh,Vc) + BCAo{Vb,Va,Vc) + BDAo{Vh,Vb,Vc) 

All these terms are zero. Any argument that you put into Aq is a linear combination of V3, v^, and Vc, 
and that means that this demonstration extends to any set of vectors, which in turn means that Aq 
vanishes for any arguments. It is identically zero and that implies A and A' are, up to a constant overall 
factor, the same. 

In N dimensions, a scalar-valued function of N vector variables, 
linear in each argument and antisymmetric under interchanging any 
pairs of arguments, is unique up to a factor. 

I've characterized this volume function A by two simple properties, and surprisingly enough this is 
all you need to compute it in terms of the components of the operator! With just this much information 
you can compute the determinant of a transformation. 

Recall: Vi has for its components the first column of the matrix for the components of /, and 
V2 forms the second column. Adding any multiple of one vector to another leaves the volume alone. 
This is 

A{vi,V2 + oivi,V3) = A{vi,V2,V3) + aA{vi,vi,V3) (7.43) 
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and the last term is zero. Translate this into components. Use the common notation for a determinant, 
a square array with vertical bars, but forget that you know how to compute this symbol! I'm going 
to use it simply as a notation by keep track of vector manipulations. The numerical value will come 
out at the end as the computed value of a volume. Vi = f{ei) = J2j fji^j' then k(^Vi,V2-,v^) = 



fll fl2 + /l3 
/2I /22 + «/21 /23 
/3I /32 + tt/31 /33 



fll fl2 /l3 
/2I /22 /23 
/3I /32 /33 



+ a 



fll fll fl3 
/2I /2I /23 
/3I /3I /33 



fll fl2 fl3 
/2I /22 /23 
/3I /32 /33 



To evaluate this object, simply choose a to make the element /12 + = 0. 

operation, adding a multiple of the first column to the third, making the element /13 
operation doesn't change the original value of A(tTi, ^^3) ■ 



Then repeat the 
f/3/ii = 0. This 



A(vi,V2 + avi,V3 + I3vi) 











fll 

/2I /22 + a/21 /23 + /3/2I 

/31 /32 + a/31 /as + /3/31 



fll 
/2I /22 /23 
/3I /32 /33 



Repeat the process to eliminate /23, adding 71T2 to the third argument, where 7 = — /, 



23/ /22- 



fll 

/2I /22 /23 
/3I /32 /33 



/ii ^ 

/2I /22 /23 + 7/22 
/3I /32 /33 + 7/32 



11 



/: 

/2I /2 







22 



/3I /32 /ss 



(7.44) 



Written in the last form, as a triangular array, the final result for the determinant does not depend 
on the elements /21, /31, /32. They may as well be zero. Why? Just do the same sort of column 
operations, but working toward the left. Eliminate /31 and f^2 adding a constant times the third 
column to the first and second columns. Then eliminate /21 by using the second column. You don't 
actually have to do this, you just have to recognize that it can be done so that you can ignore the lower 
triangular part of the array. 

Translate this back to the original vectors and A is unchanged: 

A(wi,t72,t?3) = A(/iiei,/22e2,/33e3) = /11/22/33 ^(ei, 62, 63) 



The volume of the original box is A(ei,e2,e3), so the quotient of the new volume to the old 



one IS 



det = /11/22/33 



(7.45) 



The fact that A is unique up to a constant factor doesn't matter. Do you want to measure volume 
in cubic feet, cubic centimeters, or cubic light-years? This algorithm is called Gauss elimination. 
It's development started with the geometry and used vector manipulations to recover what you may 
recognize from elsewhere as the traditional computed value of the determinant. 

Did I leave anything out in this computation of the determinant? Yes, one point. What if 
in Eq. (7.44) the number /22 = 0? You can't divide by it then. You can however interchange any 
two arguments of A, causing simply a sign change. If this contingency occurs then you need only 
interchange the two columns to get a component of zero where you want it. Just keep count of such 
switches whenever they occur. 

Trace 

There's a property closely related to the determinant of an operator. It's called the trace. If you have 
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an operator /, then consider the determinant of M = I + ef, where / is the identity. This combination 
is very close to the identity if e is small enough, so its determinant is very close to one. How close? 
The first order in e is called the trace of /, or more formally 



Tr(/) = ^ det (/ + e/) 
Express this in components for a two dimensional case, and 



(7.46) 



e=0 



(/) 



^) ^ det(/ + e/)=det(^^ + '^ ^'^^^^ = [1 + ea){l + ed) - e'bc (7.47) 



The first order coefficient of e is a + d, the sum of the diagonal elements of the matrix. This is the 
form of the result in any dimension, and the proof involves carefully looking at the method of Gauss 
elimination for the determinant, remembering at every step that you're looking for only the first order 
term in e. See problem 7.53. 

7.8 Matrices as Operators 

There's an important example of a vector space that I've avoided mentioning up to now. Example 5 
in section 6.3 is the set of n-tuples of numbers: (ai, 02, . . . , an). I can turn this on its side, call it a 
column matrix, and it forms a perfectly good vector space. The functions (operators) on this vector 
space are the matrices themselves. 

When you have a system of linear equations, you can translate this into the language of vectors. 



ax + by = e and cx + dy = f 



a b\ f x\ _ f e 
c d \y -\f 



Solving for x and y is inverting a matrix. 

There's an aspect of this that may strike you as odd. This matrix is an operator on the vector 
space of column matrices. What are the components of this operator? What? Isn't the matrix a set of 
components already? That depends on your choice of basis. Take an example 

^ = ( o ^ ) with basis = ( M , 62 ^ 



^ 3 4 y " V y ' ^ VI 

Compute the components as usual. 

Me,= [\ ^Vi^ = fo 1 = lei + 362 



3 4 y V / V 3^ 

This says that the first column of the components of M in this basis are ( 1 ) . What else would you 



expect? Now select a different basis. 

ei 

Again compute the component. 











6*2 = 1 
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The components of M in this basis are 







It doesn't look at all the same, but it represents the same operator. Does this matrix have the same 
determinant, using Eq. (7.39)? 

Determinant of Composition 

If you do one linear transformation followed by another one, that is the composition of the two functions, 
each operator will then have its own determinant. What is the determinant of the composition? Let 
the operators be F and G. One of them changes areas by a scale factor det(i^) and the other ratio of 
areas is det(G'). If you use the composition of the two functions, FG or GF, the overall ratio of areas 
from the start to the finish will be the same: 



det(FG) = det(F) ■ det(G') = det(G') ■ det(F) = det(GF) 



(7.48) 



Recall that the determinant measures the ratio of areas for any input area, not just a square; it can be 
a parallelogram. The overall ratio of the product of the individual ratios, det(-F) det(G'). The product 
of these two numbers is the total ratio of a new area to the original area and it is independent of the 
order of F and G, so the determinant of the composition of the functions is also independent of order. 

Now what about the statement that the definition of the determinant doesn't depend on the 
original area that you start with. To show this takes a couple of steps. First, start with a square that's 
not at the origin. You can always picture it as a piece of a square that is at the origin. The shaded 
square that is l/l6 the area of the big square goes over to a parallelogram that's l/l6 the area of the 
big parallelogram. Same ratio. 




An arbitrary shape can be divided into a lot of squares. That's how you do an integral. The 
image of the whole area is distorted, but it retains the fact that a square that was inside the original 
area will become a parallelogram that is inside the new area. In the limit as the number of squares goes 
to infinity you still maintain the same ratio of areas as for the single original square. 




7.9 Eigenvalues and Eigenvectors 

There is a particularly important basis for an operator, the basis in which the components form a 
diagonal matrix. Such a basis almost always* exists, and it's easy to see from the definition as usual 
just what this basis must be. 

N 

/(^») = X] 
k=l 



* See section 7.12. 
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To be diagonal simply nneans that /^j = for all i ^ k, and that in turn means that all but one term 
in the sum disappears. This defining equation reduces to 

/(cj) = fiiCi (with no sum this time) (7.49) 

This is called an eigenvalue equation. It says that for any one of these special vectors, the operator / 
on it returns a scalar multiple of that same vector. These multiples are called the eigenvalues, and the 
corresponding vectors are called the eigenvectors. The eigenvalues are then the diagonal elements of 
the matrix in this basis. 

The inertia tensor is the function that relates the angular momentum of a rigid body to its 
angular velocity. The axis of rotation is defined by those points in the rotating body that aren't moving, 
and the vector Q lies along that line. The angular momentum is computed from Eq. (7.3) and when 
you've done all those vector products and integrals you can't really expect the angular momentum to 
line up with Q unless there is some exceptional reason for it. As the body rotates around the Q axis, 
L will be carried with it, making L rotate about the direction of Q. The vector L is time-dependent 
and that implies there will be a torque necessary to keep it going, r = dL/dt. Because L is rotating 
with frequency uj, this rotating torque will be felt as a vibration at this rotation frequency. If however 
the angular momentum happens to be parallel to the angular velocity, the angular momentum will not 
be changing; dL/dt = and the torque r = dL/dt will be zero, implying that the vibrations will be 
absent. Have you ever taken your car in for servicing and asked the mechanic to make the angular 
momentum and the angular velocity vectors of the wheels parallel? It's called wheel-alignment. 

How do you compute these eigenvectors? Just move everything to the left side of the preceding 
equation. 

/(ei) - fa Ci = 0, or (/ - fiil)ei = 
/ is the identity operator, output equals input. This notation is cumbersome. I'll change it. 

f{v) = Xv ^ (/ - XI)v = (7.50) 

A is the eigenvalue and v is the eigenvector. This operator (/ — XI) takes some non-zero vector into 
the zero vector. In two dimensions then it will squeeze an area down to a line or a point. In three 
dimensions it will squeeze a volume down to an area (or a line or a point). In any case the ratio of the 
final area (or volume) to the initial area (or volume) is zero. That says the determinant is zero, and 
that's the key to computing the eigenvectors. Figure out which A's will make this determinant vanish. 

Look back at section 4.9 and you'll see that the analysis there closely parallels what I'm doing 
here. In that case I didn't use the language of matrices or operators, but was asking about the possible 
solutions of two simultaneous linear equations. 



ax + by = and cx + dy = 0, or 



a b\ f x \ _ f 
c d [y - I 



The explicit algebra there led to the conclusion that there can be a non-zero solution {x,y) to the two 
equations only if the determinant of the coefficients vanishes, ad — be = 0, and that's the same thing 
that I'm looking for here: a non-zero vector solution to Eq. (7.50). 

Write the problem in terms of components, and of course you aren't yet in the basis where the 
matrix is diagonal. If you were, you're already done. The defining equation is f{v) = Xv, and in 
components this reads 

^fkiVi = >^Vk, or 

i V /si /32 











f Vi 


J23 


II 








/aa / 








{V3 
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Here I arbitrarily wrote the equation for three dimensions. That will change with the problem, 
everything on the left side and insert the components of the identity, the unit matrix. 

/ll fl2 fl3 
/2I /22 /23 

/31 /32 /as 





Put 



(7.51) 



The one way that this has a non-zero solution for the vector v is for the determinant of the whole matrix 
on the left-hand side to be zero. This equation is called the characteristic equation of the matrix, and in 
the example here that's a cubic equation in A. If it has all distinct roots, no double roots, then you're 
guaranteed that this procedure will work, and you will be able to find a basis in which the components 
form a diagonal matrix. If this equation has a multiple root then there is no guarantee. It may work, but 
it may not; you have to look closer. See section 7.12. If the operator has certain symmetry properties 
then it's guaranteed to work. For example, the symmetry property found in problem 7.16 is enough 
to insure that you can find a basis in which the matrix for the inertia tensor is diagonal. It is even an 
orthogonal basis in that case. 

Example of Eigenvectors 

To keep the algebra to a minimum, 
example: 



'II work in two dimensions and will specify an arbitrary but simple 



/(ei) = 2ei -1-62, /(e2) = 2e2 -I- Ci with components 



The eigenvalue equation is, in component form 



2 1 
1 2 



or 



2 1 
1 2 



M 



1 
1 



(7.52) 



(7.53) 



The condition that there be a non-zero solution to this is 



det 



2 1 
1 2 



A 



1 
1 



= (2- A)=^ 



1 



The solutions to this quadratic are A = 1, 3. For these values then, the apparently two equation for 
the two unknowns Vi and V2 are really one equation. The other is not independent. Solve this single 
equation in each case. Take the first of the two linear equations for Vi and V2 as defined by Eq. (7.53). 



A = 1 implies 
The two new basis vectors are then 



2Vi +V2 = Xvi 

V2 = — fi, A = 3 implies 



V2 = Vi 



(ei - 62) 



and 



(ei + 62) 



(7.54) 



and in this basis the matrix of components is the diagonal matrix of eigenvalues. 



1 
3 



If you like to keep your basis vectors normalized, you may prefer to say that the new basis is (ci — e2)/\/2 
and (ei + e2)/V^- The eigenvalues are the same, so the new matrix is the same. 



7 — Operators and Matrices 



22 



Example: Coupled Oscillators 

Another example drawn from physics: Two masses are connected to a set of springs and fastened 
between two rigid walls. This is a problem that appeared in chapter 4, Eq. (4.45). 

m.id'^xi/dt'^ = —kiXi — k3{xi — X2), and m2(P'X2/dt^ = —k2X2 — k^{x2 — Xi) 

The exponential form of the solution was 

xi{t)=Ae''^\ X2{t) = Be^^ 

The algebraic equations that you get by substituting these into the differential equations are a 
pair of linear equations for A and B, Eq. (4.47). In matrix form these equations are, after rearranging 
some minus signs, 



ki + ^3 -/C3 

-/ca k2 + k3 



mi 



1712 

You can make it look more like the previous example with some further arrangement 



ki + fcs -fcs 
-/ca k2 + /ca 



• UJ 



2 f mi 

777.2 



The matrix on the left side maps the column matrix to zero. That can happen only if the matrix has 
zero determinant (or the column matrix is zero). If you write out the determinant of this 2x2 matrix 
you have a quadratic equation in cu"^. It's simple but messy, so rather than looking first at the general 
case, look at a special case with more symmetry. Take 777i = 7772 = tti and ki = /c2- 



det 



ki + A'-a -/ca 
-ks ki + ka 



2 / 1 



= (/ci + /ca - mw'^Y ~ ^3 



This is now so simple that you don't even need the quadratic formula; it factors directly. 

[ki + kz- mu^ - kz) [ki + /ca - mou^ + ks) =0 
The only way that the product of two numbers is zero is if one of the numbers is zero, so either 

ki — muj'^ = or ki + 2ks — muP' = 
This determines two possible frequencies of oscillation. 



and UJ2 



ki + 2A;a 

777 



You're not done yet; these are just the eigenvalues. You still have to find the eigenvectors and then go 
back to apply them to the original problem. This \s F = ma after all. Look back to section 4.10 for 
the development of the solutions. 
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7.10 Change of Basis 

In many problems in physics and mathematics, the correct choice of basis can enormously simplify a 
problem. Sometimes the obvious choice of a basis turns out in the end not to be the best choice, and 
you then face the question: Do you start over with a new basis, or can you use the work that you've 
already done to transform everything into the new basis? 

For linear transformations, this becomes the problem of computing the components of an operator 
in a new basis in terms of its components in the old basis. 

First: Review how to do this for vector components, something that ought to be easy to do. The 
equation (7.5) defines the components with respect to a basis, any basis. If I have a second proposed 
basis, then by the definition of the word basis, every vector in that second basis can be written as a 
linear combination of the vectors in the first basis. I'll call the vectors in the first basis, Cj and those in 
the second basis e'j, for example in the plane you could have 

ei=x, 62 = y, and e[ = 2x + 0.5y, e2 = 0.5x + 2y (7.55) 

Each vector e'- is a linear combination* of the original basis vectors: 

e'i = S{ei) = Y,SJ^eJ (7.56) 

3 

This follows the standard notation of Eq. (7.6); you have to put the indices in this order in order to 
make the notation come out right in the end. One vector expressed in two different bases is still one 
vector, so 

i i 

and I'm using the fairly standard notation of f ^ for the i^^ component of the vector y with respect to 
the second basis. Now insert the relation between the bases from the preceding equation (7.56). 

i j j 

and this used the standard trick of changing the last dummy label of summation from i to j so that it 
is easy to compare the components. 

"^^Sjiv'^ = Vj or in matrix notation {S){v') = (t>), =^ {v') = {SY^{v) 

i 

Similarity Transformations 

Now use the definition of the components of an operator to get the components in the new basis. 

3 

f[Y. ^ji^j) = ^j^fi^j) = Yl fkA = Y f'ji Y ^kj^k 

j j j k j k 

* There are two possible conventions here. You can write e'- in terms of the Cj, calling the 
coefficients Sji, or you can do the reverse and call those components Sji. [e^ = S{e'-)] Naturally, both 
conventions are in common use. The reverse convention will interchange the roles of the matrices S 
and S^^ in what follows. 
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The final equation comes from the preceding line. The coefficients of must agree on the two sides 
of the equation. 

j j 

Now rearrange this in order to place the indices in their conventional row,coiumn order. 

J J 57^ 

f Sii Si2\ f f'li fl2\^( /ll fl2\ ( Sii Si2\ 
V S2I S22 J V /2I /22 / \ /2I /22 / V "^21 'S'22 / 

In turn, this matrix equation is usually written in terms of the inverse matrix of S, 

ism = {f){S) is (/') = {sr\f){S) (7.58) 

and this is called a similarity transformation. For the example Eq. (7.55) this is 

e[ = 2x + 0.5y = SuCi + 5*2162 
which determines the first column of the matrix (S), then e'2 determines the second column. 

2 0.5\ ., ,r,.-i_ 1 / 2 -0.5 



(^)=U-5 2 j 3.75 V -0.5 2 

Eigenvectors 

In defining eigenvalues and eigenvectors I pointed out the utility of having a basis in which the com- 
ponents of an operator form a diagonal matrix. Finding the non-zero solutions to Eq. (7.50) is then 
the way to find the basis in which this holds. Now I've spent time showing that you can find a matrix 
in a new basis by using a similarity transformation. Is there a relationship between these two subjects? 
Another way to ask the question: I've solved the problem to find all the eigenvectors and eigenvalues, 
so what is the similarity transformation that accomplishes the change of basis (and why is it necessary 
to know it if I already know that the transformed, diagonal matrix is just the set of eigenvalues, and I 
already know them)? 

For the last question, the simplest answer is that you don't need to know the explicit transforma- 
tion once you already know the answer. It is however useful to know that it exists and how to construct 
it. If it exists — I'll come back to that presently. Certain manipulations are more easily done in terms 
of similarity transformations, so you ought to know how they are constructed, especially because almost 
all the work in constructing them is done when you've found the eigenvectors. 

The equation (7.57) tells you the answer. Suppose that you want the transformed matrix to be 
diagonal. That means that = and = 0. Write out the first column of the product on the 
right. 

/ll /12 \ / 'S'li 5*12 \ , f /ll /12 \ / 'S'li 



/2I /22 / \ 'S'21 S'22 / \ /2I /22 / V 'S'21 

This equals the first column on the left of the same equation 



11 



S'li 
S21 
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This is the eigenvector equation that you've supposedly already solved. The first column of the com- 
ponent matrix of the similarity transformation is simply the set of components of the first eigenvector. 
When you write out the second column of Eq. (7.57) you'll see that it's the defining equation for the 
second eigenvector. You already know these, so you can immediately write down the matrix for the 
similarity transformation. 

For the example Eq. (7.52) the eigenvectors are given in Eq. (7.54). In components these are 

e 1 ^ ( _\ J , and ^2^(1)' implying 5 = f J 



The inverse to this matrix is 



s 



-1 



1(1 -1 

2 U 1 



You should verify that S ^MS is diagonal. 



7.11 Summation Convention 

In all the manipulation of components of vectors and components of operators you have to do a lot of 
sums. There are so many sums over indices that a convention* was invented (by Einstein) to simplify 
the notation. 

A repeated index in a term is summed. 

Eq. (7.6) becomes /(e^) = fkA. 
Eq. (7.8) becomes = fki'^i- 
Eq. (7.26) becomes hf^^ = fkjQji- 
IM = M becomes ^yM^-fc = M^fc. 

What if there are three identical indices in the same term? Then you made a mistake; that can't 
happen. What about Eq. (7.49)? That has three indices. Yes, and there I explicitly said that there is 
no sum. This sort of rare case you have to handle as an exception. 

7.12 Can you Diagonalize a Matrix? 

At the beginning of section 7.9 I said that the basis in which the components of an operator form a 
diagonal matrix "almost always exists." There's a technical sense in which this is precisely true, but 
that's not what you need to know in order to manipulate matrices; the theorem that you need to have 
is that every matrix is the limit of a sequence of diagonalizable matrices. If you encounter a matrix that 
cannot be diagonalized, then you can approximate it as closely as you want by a matrix that can be 
diagonalized, do your calculations, and finally take a limit. You already did this if you did problem 4.11, 
but in that chapter it didn't look anything like a problem involving matrices, much less diagonalization 
of matrices. Yet it is the same. 
Take the matrix 

'1 2' 
1 

You can't diagonalize this. If you try the standard procedure, here is what happens: 



,0 ly V^2/ \V2) V 1 - 

The resulting equations you get for A = 1 are 

Qvi + 2V2 = Q and = 



There is a modification of this convention that appears in chapter 12, section 12.5 
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This provides only one eigenvector, a nnultiple of ( M . You need two for a basis. 



^0^ 

Change this matrix in any convenient way to make the two roots of the characteristic equation 
different from each other. For example, 



M. 



e 



1 + 6 2 
1 



The eigenvalue equation is now 

(H-e-A)(l- A) = 
and the resulting equations for the eigenvectors are 

A = 1 : e^;i + 2^;2 = 0, = A = 1 + e : 0^;i + 2^2 = 0, e^;2 = 

Now you have two distinct eigenvectors, 

X = 1:{_1/,), and A = l + e:(j 

You see what happens to these vectors as e ^ 0. 
Differential Equations at Critical 

Problem 4.11 was to solve the damped harmonic oscillator for the critical case that — 4km = 0. 

m-r-75- = —kx — b-r- (7.59) 
at"' at 

Write this as a pair of equations, using the velocity as an independent variable. 

dx , dvx k b 

-T7 =Vx and -7- = X Vx 

dt dt m m 

In matrix form, this is a matrix differential equation. 

d f x\ ( \ \ ( X 



dt\VxJ \-k/m -b/m J \V; 



->x 



This is a linear, constant-coefficient differential equation, but now the constant coefficients are matrices. 
Don't let that slow you down. The reason that an exponential form of solution works is that the 
derivative of an exponential is an exponential. Assume such a solution here. 

When you divide the equation by e"*, you're left with an eigenvector equation where the eigenvalue 
is a. As usual, to get a non-zero solution set the determinant of the coefficients to zero and the 
characteristic equation is 

det I ^, , / I = aia + b/m) + k/m = 

\—k/m —b/m — a J ^ 111 
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with familiar roots 



a= ( — 6 ± \I\P' ~ 4km) / 2m 



If the two roots are equal you may not have distinct eigenvectors, and in this case you do not. No 
matter, you can solve any such problem for the case that 6^ — Akm ^ and then take the limit as this 
approaches zero. 

The eigenvectors come from the either one of the two equations represented by Eq. (7.60). Pick 



the simpler one, aA = B. The column matrix 



Pick the initial conditions that a;(0) = and Vx{0) = Vq. You must choose some initial conditions in 
order to apply this technique. In matrix terminology this is 



These are two equations for the two unknowns 

A+ + A_=0, a+A+ + a-A. =vo, 



so 



A, 



a+ — a_ 



A. 



-A, 



X 
Vx 



a+ — a- 



If you now take the limit as 6^ — )• Akm, or equivalently as «_ —5- a+, this expression is just the definition 
of a derivative. 



a 



_b_ 
2m 



(7.61) 



7.13 Eigenvalues and Google 

The motivating idea behind the search engine Google is that you want the first items returned by a 
search to be the most important items. How do you do this? How do you program a computer to 
decide which web sites are the most important? 

A simple idea is to count the number of sites that contain a link to a given site, and the site that 
is linked to the most is then the most important site. This has the drawback that all links are treated 
as equal. If your site is referenced from the home page of Al Einstein, it counts no more than if it's 
referenced by Joe Blow. This shouldn't be. 

A better idea is to assign each web page a numerical importance rating. If your site, ^1, is 
linked from sites #11, #59, and #182, then your rating, Xi, is determined by adding those ratings 
(and multiplying by a suitable scaling constant). 

xi = C{xu +a;59 +a;i82) 
Similarly the second site's rating is determined by what links to it, as 



X2 — C (Xi37 + X157983 + Xi + Xgye) 
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But this assumes that you already know the ratings of the sites, and that's what you're trying to find! 
Write this in matrix language. Each site is an element in a huge column matrix {sj}. 



iV 

Xj = C ^ ^ cxijXj 



or 




C 



/ 








1 





1 






1 





















1 





1 


1 




V 












■/ 




An entry of 1 indicates a link and a is no link. This is an eigenvector problem with the eigenvalue 
A = l/C, and though there are many eigenvectors, there is a constraint that lets you pick the right 
one. All the XjS must be non-negative, and there's a theorem (Perron-Frobenius) guaranteeing that 
you can find such an eigenvector. This algorithm is a key idea behind Google's ranking methods. They 
have gone well beyond this basic technique of course, but the spirit of the method remains. 
See www-db.stanford.edu/~backrub/google.html for more on this. 

7.14 Special Operators 

Symmetric 

Antisymmetric 

Hermitian 

Antihermitian 

Orthogonal 

Unitary 

Idempotent 

Nilpotent 

Self-adjoint 

In no particular order of importance, these are names for special classes of operators. It is often 
the case that an operator defined in terms of a physical problem will be in some way special, and it's 
then worth knowing the consequent simplifications. The first ones involve a scalar product. 



Symmetric: 
Antisymmetric: 



{u,Siv)) = {Siu),v) 
{u,A{v)) = -{A{u),v) 



The inertia operator of Eq. (7.3) is symmetric. 

I{u) = Jdmrx(ujxf) satisfies (^uji,I{uj2)) = ^i- Ii^^2) = {I{^i),^^2) = H^i) -^^2 
Proof: Plug in. 

Ui ■ I (002) = oJi- J dmf X {u2 X r ) = cDi ■ j dm \uj2 — f{u2 ■ r )] 
dm [ui ■oj2r'^ - [oJi ■ f){u2 ■ r )] = -^(^^i) " ^2 



What good does this do? You will be guaranteed that all eigenvalues are real, all eigenvectors are 
orthogonal, and the eigenvectors form an orthogonal basis. In this example, the eigenvalues are moments 
of inertia about the axes defined by the eigenvectors, so these moments better be real. The magnetic 
field operator (problem 7.28) is antisymmetric. 

Hermitian operators obey the same identity as symmetric: (n, if('u)) = The 
difference is that in this case you allow the scalars to be complex numbers. That means that the scalar 
product has a complex conjugation implied in the first factor. You saw this sort of operator in the 
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chapter on Fourier series, section 5.3, but it didn't appear under this name. You will become familiar 
with this class of operators when you hit quantum mechanics. Then they are ubiquitous. The same 
theorem as for symmetric operators applies here, that the eigenvalues are real and that the eigenvectors 
are orthogonal. 

Orthogonal operators satisfy {0{u),0{v)^ = (u.vY The most familiar example is rotation. 
When you rotate two vectors, their magnitudes and the angle between them do not change. That's all 
that this equation says — scalar products are preserved by the transformation. 

Unitary operators are the complex analog of orthogonal ones: (U{u)^U{v)) = {u,v), but all 
the scalars are complex and the scalar product is modified accordingly. 

The next couple you don't see as often. Idempotent means that if you take the square of the 
operator, it equals the original operator. 

Nilpotent means that if you take successive powers of the operator you eventually reach the 
zero operator. 

Self-adjoint in a finite dimensional vector space is exactly the same thing as Hermitian. In 
infinite dimensions it is not, and in quantum mechanics the important operators are the self-adjoint 
ones. The issues involved are a bit technical. As an aside, in infinite dimensions you need one extra 
hypothesis for unitary and orthogonal: that they are invertible. 
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Problems 

7.1 Draw a picture of the effect of these linear transformations on the unit square with vertices at 
(0, 0), (1, 0), (1, 1), (0, 1). The matrices representing the operators are 

(^)O^)^ (^)O:^)^ iH-r.) 

Is the orientation preserved or not in each case? See the figure at the end of section 7.7 

7.2 Using the same matrices as the preceding question, what is the picture resulting from doing (a) 
followed by (c)? What is the picture resulting from doing (c) followed by (a)? The results of section 
7.4 may prove helpful. 

7.3 Look again at the parallelogram that is the image of the unit square in 
the calculation of the determinant. In Eq. (7.39) I used the cross product to 
get its area, but sometimes a brute-force method is more persuasive. If the 

transformation has components The corners of the parallelogram 

that is the image of the unit square are at (0,0), (a,c), {a + b,c + d), 
{b,d). You can compute its area as sums and differences of rectangles and 
triangles. Do so; it should give the same result as the method that used a 
cross product. 

7.4 In three dimensions, there is an analogy to the geometric interpretation of the cross product as the 
area of a parallelogram. The triple scalar product A - B x C is the volume of the parallelepiped having 
these three vectors as edges. Prove both of these statements starting from the geometric definitions 
of the two products. That is, from the AB cos9 and AB sinO definitions of the dot product and the 
magnitude of the cross product (and its direction). 

7.5 Derive the relation w = a; x r for a point mass rotating about an axis. Refer to the figure before 
Eq. (7.2). 

7.6 You have a mass attached to four springs in a plane and that are in turn attached to four walls as 
on page 3; the mass is at equilibrium. Two opposing spring have spring constant ki and the other two 
are Push on the mass with a (small) force F and the resulting displacement of m \s d = f{F), 
defining a linear operator. Compute the components of / in an obvious basis and check a couple of 
special cases to see if the displacement is in a plausible direction, especially if the two k's are quite 
different. 

7.7 On the vector space of quadratic polynomials, degree < 2, the operator d/dx is defined: the 
derivative of such a polynomial is a polynomial, (a) Use the basis Cq = 1, Ci = x, and 6*2 = 
and compute the components of this operator, (b) Compute the components of the operator /dx"^ . 
(c) Compute the square of the first matrix and compare it to the result for (b). Ans: (a)^ = (b) 

7.8 Repeat the preceding problem, but look at the case of cubic polynomials, a four-dimensional space. 

7.9 In the preceding problem the basis 1, x, x^, x^ is too obvious. Take another basis, the Legendre 
polynomials: 

3 1 5 3 

Pq{x) = 1, Pi{x) = X, P2{x) = -x^ - -, Psix) = -x^ - -X 
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and repeat the problem, finding components of the first and second derivative operators. Verify an 
example explicitly to check that your matrix reproduces the effect of differentiation on a polynomial of 
your choice. Pick one that will let you test your results. 

7.10 What is the determinant of the inverse of an operator, explaining why? 
Ans: 1/ det(original operator) 

7.11 Eight identical point masses m are placed at the corners of a cube that has one corner at the origin 
of the coordinates and has its sides along the axes. The side of the cube is length = a. In the basis that 
is placed along the axes as usual, compute the components of the inertia tensor. Ans: In = 8ma^ 

7.12 For the dumbbell rotating about the off-axis axis in Eq. (7.19), what is the time-derivative of L? 
In very short time dt, what new direction does L take and what then is dL7 That will tell you dL/dt. 
Prove that this is a; x L. 



7.13 A cube of uniform volume mass density, mass m, and side a has one corner at the origin of 
the coordinate system and the adjacent edges are placed along the coordinate axes. Compute the 
components of the tensor of inertia. Do it (a) directly and (b) by using the parallel axis theorem to 
check your result. 

/ 2/3 -1/4 -1/4 
Ans: ma? -1/4 2/3 -1/4 
\-l/4 -1/4 2/3^ 

7.14 Compute the cube of Eq. (7.13) to find the trigonometric identities for the cosine and sine of 
triple angles in terms of single angle sines and cosines. Compare the results of problem 3.9. 

7.15 On the vectors of column matrices, the operators are matrices. For the two dimensional case take 
M = ^) ^^^'^ '^^ components in the basis ^ | ^ ^i^*^ ^ | ^ ■ 

What is the determinant of the resulting matrix? Ans: Mn = [a + b + c + d)/2, and the determinant 
is still ad — be. 



7.16 Show that the tensor of inertia, Eq. (7.3), satisfies a;i-/(ci;2) = I{uji)-id2- What does this 
identity tell you about the components of the operator when you use the ordinary orthonormal basis? 
First determine in such a basis what ei ■/(e2) is. 

7.17 Use the definition of the center of mass to show that the two cross terms in Eq. (7.21) are zero. 

7.18 Prove the Perpendicular Axis Theorem. This says that for a mass that lies flat in a plane, the 
moment of inertia about an axis perpendicular to the plane equals the sum of the two moments of 
inertia about the two perpendicular axes that lie in the plane and that intersect the third axis. 

7.19 Verify in the conventional, non-matrix way that Eq. (7.61) really does provide a solution to the 
original second order differential equation (7.59). 



7.20 The Pauli spin matrices are 



^-=(? o)' ^^=(z o')' ""^=(0 -1) 
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Show that (Tx(Ty = i(Jz and the same for cyclic permutations of the indices x, y, z. Compare the 
products (JxCTy and ayax and the other pairings of these matrices. 

7.21 Interpret a ■ A as CxAx + o'yAy + azAz and prove that 

a-Aa-B = A-B + ia-AxB 
where the first term on the right has to include the identity matrix for this to make sense. 

7.22 Evaluate the matrix 

-^^={I-a.A)-' 
I-a-A ^ 

Evaluate this by two methods: (a) You may assume that A is in some sense small enough for you to 
manipulate by infinite series methods. This then becomes a geometric series that you can sum. Use 
the results of the preceding problem. 

(b) You can manipulate the algebra directly without series. I suggest that you recall the sort of 
manipulation that allows you to write the complex number 1/(1 —i) without any z's in the denominator. 
I suppose you could do it a third way, writing out the 2x2 matrix and explicitly inverting it, but I 
definitely don't recommend this. 

7.23 Evaluate the sum of the infinite series defined by e^^^y^ . Where have you seen this result before? 
The first term in the series must be interpreted as the identity matrix. Ans: I cos 6 — iaySva.6 

7.24 For the moment of inertia about an axis, the integral is J r^dm. State precisely what this m 
function must be for this to make sense as a Riemann-Stieltjes integral, Eq. (1.28). For the case that 
you have eight masses, all rriQ at the 8 corners of a cube of side a, write explicitly what this function 
is and evaluate the moment of inertia about an axis along one edge of the cube. 

7.25 The summation convention allows you to write some compact formulas. Evaluate these, assuming 
that you're dealing with three dimensions. Note Eq. (7.30). Define the alternating symbol e^jf^ to be 
1: It is totally anti-symmetric. That is, interchange any two indices and you change the sign of the 
value. 

2: ei23 = 1- [E.g. £132 = -1, £312 = +1] 

^iit ^ijk^jBki ^ij^ijki ^mnAmBfij SrnnUmVn} f^nVm 

Multiply the last identity by AjBmCn and interpret. 

7.26 The set of Hermite polynomials starts out as 

Ho = 1, Hi = 2x, H2 = - 2, H3 = - 12x, H4 = 16x^ - 48x'^ + 12, 

(a) For the vector space of cubic polynomials in x, choose a basis of Hermite polynomials and compute 
the matrix of components of the differentiation operator, d/dx. 

(b) Compute the components of the operator d"^ / dx"^ and show the relation between this matrix and 
the preceding one. 

7.27 On the vector space of functions of x, define the translation operator 



Taf = g means g{x) = f{x-a) 
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This picks up a function and moves it by a to the right. 

(a) Pick a simple example function / and test this definition graphically to verify that it does what I 
said. 

(b) On the space of cubic polynomials and using a basis of your choice, find the components of this 
operator. 

(c) Square the resulting matrix and verify that the result is as it should be. 

(d) What is the inverse of the matrix? (You should be able to guess the answer and then verify it. Or 
you can work out the inverse the traditional way.) 

(e) What if the parameter a is huge? Interpret some of the components of this first matrix and show 
why they are clearly correct. (If they are.) 

(f) What is the determinant of this operator? 

(g) What are the eigenvectors and eigenvalues of this operator? 

7.28 The force by a magnetic field on a moving charge \s F = qv x B. The operation v x B defines 
a linear operator on v, stated as f{v) = v x B . What are the components of this operator expressed 

in terms of the three components of the vector Bl What are the eigenvectors and eigenvalues of this 
operator? For this last part, pick the basis in which you want to do the computations. If you're not 
careful about this choice, you are asking for a lot of algebra. Ans: eigenvalues: 0, ±.iB 

7.29 In section 7.8 you have an operator M expressed in two different bases. What is its determinant 
computed in each basis? 

7.30 In a given basis, an operator has the values 

A{ei) = ei + 36*2 and A{e2) = 2ei + 4e4 

(a) Draw a picture of what this does, (b) Find the eigenvalues and eigenvectors and determinant o\ A 
and see how this corresponds to the picture you just drew. 

7.31 The characteristic polynomial of a matrix M is det(M — XT). I is the identity matrix and A is 
the variable in the polynomial. Write the characteristic polynomial for the general 2x2 matrix. Then 
in place of A in this polynomial, put the matrix M itself. The constant term will have to include the 
factor / as usual. For this 2x2 case verify the Cayley-Hamilton Theorem, that the matrix satisfies its 
own characteristic equation, making this polynomial in M the zero matrix. 

7.32 (a) For the magnetic field operator defined in problem 7.28, place 5 = 63 along the direction of 

B. Then take ei = {x — iy)/ \f2, 62 = {x + iy)/ \/2 and find the components of the linear operator 
representing the magnetic field, (b) A charged particle is placed in this field and the equations of 
motion are ma = F = qv x B. Translate this into the operator language with a matrix like that of 
problem 7.28, and write F = ma in this language and this basis. Ans: (part) mf'i = —iqBri, where 
ri = {x + iy)/\^. mr2 = +iqBr2, where r2 = {x — iy)/\/2. 

7.33 For the operator in problem 7.27 part (b), what are the eigenvectors and eigenvalues? 

7.34 A nilpotent operator was defined in section 7.14. For the operator defined in problem 7.8, show 
that it is nilpotent. How does this translate into the successive powers of its matrix components? 

7.35 A cube of uniform mass density has side a and mass m. Evaluate its moment of inertia about an 
axis along a longest diagonal of the cube. Note: If you find yourself entangled in a calculation having 



7 — Operators and Matrices 



34 



multiple integrals with hopeless limits of integration, toss it out and start over. You may even find 
problem 7.18 useful. Ans: ma? /Q 

7.36 Show that the set of all 2 x 2 matrices forms a vector space. Produce a basis for it, and so what 
is its dimension? 

7.37 In the vector space of the preceding problem, the following transformation defines an operator. 
f{M) = SMS^^. For S, use the rotation matrix of Eq. (7.13) and compute the components of this 
operator /. The obvious choice of basis would be matrices with a single non-zero element 1. Instead, 
try the basis /, ax, o"y, Cz- Ans: A rotation by 2a about the y-axis, e.g. f{ei ) = Ci cos 2a — 63 sin 2a . 

7.38 What are the eigenvectors and eigenvalues of the operator in the preceding problem? Now you'll 
be happy I suggested the basis that I did. 

7.39 (a) The commutator of two matrices is defined to be = AB — BA. Show that this 
commutator satisfies the Jacobi identity. 

[A, [B,C]] + [B,[C,A]] + [C, [A,B]] = 

(b) The anti-commutator of two matrices is {A, B} = AB + BA. Show that there is an identity like 
the Jacobi identity, but with one of the two commutators (the inner one or the outer one) replaced by 
an anti-commutator. I'll leave it to you to figure out which. 

7.40 Diagonalize each of the Pauli spin matrices of problem 7.20. That is, find their eigenvalues and 
specify the respective eigenvectors as the basis in which they are diagonal. 

7.41 What are the eigenvalues and eigenvectors of the rotation matrix Eq. (7.13)? Translate the answer 
back into a statement about rotating vectors, not just their components. 

7.42 Same as the preceding problem, but replace the circular trigonometric functions in Eq. (7.13) 
with hyperbolic ones. Also change the sole minus sign in the matrix to a plus sign. Draw pictures of 
what this matrix does to the basis vectors. What is its determinant? 

7.43 Compute the eigenvalues and eigenvectors of the matrix Eq. (7.18). Interpret each. 

7.44 Look again at the vector space of problem 6.36 and use the basis /i, /2, /a that you constructed 
there, (a) In this basis, what are the components of the two operators described in that problem? 

(b) What is the product of these two matrices? Do it in the order so that it represents the composition 
of the first rotation followed by the second rotation. 

(c) Find the eigenvectors of this product and from the result show that the combination of the two 
rotations is a third rotation about an axis that you can now specify. Can you anticipate before solving 
it, what one of the eigenvalues will be? 

(d) Does a sketch of this rotation axis agree with what you should get by doing the two original rotations 
in order? 

7.45 Verify that the Gauss elimination method of Eq. (7.44) agrees with (7.38). 

7.46 What is the determinant of a nilpotent operator? See problem 7.34. 

7.47 (a) Recall (or look up) the method for evaluating a determinant using cofactors (or minors). For 
2x2, 3x3, and in fact, for NxN arrays, how many multiplication operations are required for this. 
Ignore the time to do any additions and assume that a computer can do a product in 10"^" seconds. 
How much time does it take by this method to do the determinant for 10x10, 20x20, and 30x30 
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arrays? Express the times in convenient units. 

(b) Repeat this for the Gauss elimination algorithm at Eq. (7.44). How much time for the above three 
matrices and for 100x100 and 1000x1000? Count division as taking the same time as multiplication. 
Ans: For the first method, 30x30 requires lOOOOxage of universe. For Gauss it is Sfis. 

7.48 On the vector space of functions on < x < L, (a) use the basis of complex exponentials, 
Eq. (5.20), and compute the matrix components of d/dx. 

(b) Use the basis of Eq. (5.17) to do the same thing. 

7.49 Repeat the preceding problem, but for d'^/dx'^. Compare the result here to the squares of the 
matrices from that problem. 

7.50 Repeat problem 7.27 but using a different vector space of functions with basis 

(a) e'*^^^/^, {n = 0, ±1, ±2,...) 

(b) cos{n7Tx/L), and sm(m7rx/L). 

These functions will be a basis in the set of periodic functions of x, and these will be very big matrices. 

7.51 (a) What is the determinant of the translation operator of problem 7.27? 
(b) What is the determinant of d/dx on the vector space of problem 7.26? 

7.52 (a) Write out the construction of the trace in the case of a three dimensional operator, analogous 
to Eq. (7.47). What are the coefficients of and e"^? (b) Back in the two dimensional case, draw a 
picture of what (/ + e/) does to the basis vectors to first order in e. 

7.53 Evaluate the trace for arbitrary dimension. Use the procedure of Gauss elimination to compute 
the determinant, and note at every step that you are keeping terms only through and e^. Any higher 
orders can be dropped as soon as they appear. Ans: X]t=i fn 

7.54 The set of all operators on a given vector space forms a vector space.* (Show this.) Consider 
whether you can or should restrict yourself to real numbers or if you ought to be dealing with complex 
scalars. 

Now what about the list of operators in section 7.14. Which of them form vector spaces? 
Ans: Yes(real), Yes(real), No, No, No, No, No, No, No 

7.55 In the vector space of cubic polynomials, choose the basis 

6*0 = 1, ei = l + x, 62 = 1+ x + x"^, 63 = 1 + X + + a;^. 

In this basis, compute the matrix of components of the operator P , where this is the parity operator, 
defined as the operator that takes the variable x and changes it to —x. For example P(ei) = I — x. 
Compute the square of the resulting matrix. What is the determinant of P? If you had only the 
quadratic polynomials with basis eo, Ci, 6*2, what is the determinant? What about linear polynomials, 
with basis Co, Ci? Maybe even constant polynomials? 

7.56 On the space of quadratic polynomials define an operator that permutes the coefficients: f{x) = 
ax'^ + bx + c, then Of = g has g{x) = bx'^ + cx + a. Find the eigenvalues and eigenvectors of this 
operator. 



* If you're knowledgeable enough to recognize the difficulty caused by the question of domains, 
you'll recognize that this is false in infinite dimensions. But if you know that much then you don't need 
to be reading this chapter. 
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7.57 The results in Eq. (7.36) is a rotation about some axis. Where is it? Notice that a rotation about 
an axis leaves the axis itself alone, so this is an eigenvector problem. If it leaves the vector alone, you 
even know what the eigenvalue is, so you can easily find the vector. Repeat for the other rotation, 
found in Eq. (7.37) Ans: Ci + 62 — 63 

7.58 Find the eigenvectors and eigenvalues of the matrices in problem 7.1. 



